Predicting essential genes in fungal genomes.

نویسندگان

  • Michael Seringhaus
  • Alberto Paccanaro
  • Anthony Borneman
  • Michael Snyder
  • Mark Gerstein
چکیده

Essential genes are required for an organism's viability, and the ability to identify these genes in pathogens is crucial to directed drug development. Predicting essential genes through computational methods is appealing because it circumvents expensive and difficult experimental screens. Most such prediction is based on homology mapping to experimentally verified essential genes in model organisms. We present here a different approach, one that relies exclusively on sequence features of a gene to estimate essentiality and offers a promising way to identify essential genes in unstudied or uncultured organisms. We identified 14 characteristic sequence features potentially associated with essentiality, such as localization signals, codon adaptation, GC content, and overall hydrophobicity. Using the well-characterized baker's yeast Saccharomyces cerevisiae, we employed a simple Bayesian framework to measure the correlation of each of these features with essentiality. We then employed the 14 features to learn the parameters of a machine learning classifier capable of predicting essential genes. We trained our classifier on known essential genes in S. cerevisiae and applied it to the closely related and relatively unstudied yeast Saccharomyces mikatae. We assessed predictive success in two ways: First, we compared all of our predictions with those generated by homology mapping between these two species. Second, we verified a subset of our predictions with eight in vivo knockouts in S. mikatae, and we present here the first experimentally confirmed essential genes in this species.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fungome: Annotating proteins implicated in fungal pathogenesis

Sequencing genomes of different pathogenic fungi produced plethora of genetic information. This "omics" data might be of great interest to probe strain diversity, identify virulence factors and complementary genes in other fungal species, and importantly in predicting the role of proteins specific to pathogenesis in humans. We propose a component called "fungome" for those fungal proteins impli...

متن کامل

Exploring the Optimal Strategy to Predict Essential Genes in Microbes

Accurately predicting essential genes is important in many aspects of biology, medicine and bioengineering. In previous research, we have developed a machine learning based integrative algorithm to predict essential genes in bacterial species. This algorithm lends itself to two approaches for predicting essential genes: learning the traits from known essential genes in the target organism, or t...

متن کامل

Acquired Antimicrobial Resistance Genes of Escherichia coli Obtained from Nigeria: In silico Genome Analysis

Background: Antimicrobial resistance is a global problem with enormous public health and economic impact. This study was carried out to get an overview of acquired antimicrobial resistance gene sequences in the genomes of Escherichia coli isolated from different food sources and the environment in Nigeria. Methods: To determine the acquired antimicrobial-resistant genes prevalence, genome asse...

متن کامل

In Silico Sequence Analysis Reveals New Characteristics of Fungal NADPH Oxidase Genes

NADPH oxidases (Noxes), transmembrane proteins found in most eukaryotic species, generate reactive oxygen species and are thereby involved in essential biological processes. However, the fact that genes encoding ferric reductases and ferric-chelate reductases share high sequence similarities and domains with Nox genes represents a challenge for bioinformatic approaches used to identify Nox-enco...

متن کامل

Using Population and Comparative Genomics to Understand the Genetic Basis of Effector-Driven Fungal Pathogen Evolution

Epidemics caused by fungal plant pathogens pose a major threat to agro-ecosystems and impact global food security. High-throughput sequencing enabled major advances in understanding how pathogens cause disease on crops. Hundreds of fungal genomes are now available and analyzing these genomes highlighted the key role of effector genes in disease. Effectors are small secreted proteins that enhanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genome research

دوره 16 9  شماره 

صفحات  -

تاریخ انتشار 2006